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Description 

Field of the present invention 

[0001] The present invention concerns new peptides and the nucleic acid molecules encoding said peptides, the 
vector comprising said nucleic acid molecules, the cells transformed by said vector, inhibitors directed against said 
peptides or said nucleic acid molecules, a pharmaceutical composition and a diagnostic and/or dosage device com- 
prising said products, and non human transgenic animals expressing the- peptides according to the invention or the 
nucleic acid molecules encoding said peptides. 

£0002] The invention further provides a method for determining ligand binding, detecting expression, screening for 
drugs binding specifically to said peptides and treatments involving the peptides orthe nucleic acid molecules according 
to the invention. 

Technological background and state of the art. 

[0003] Chemotactic cytokines, or chemokines, are small signalling proteins that can be divided in two subfamilies 
{CC- and CXC-chemokines) depending on the relative position of the first two conserved cysteines. Interleukin 8 (IL- 
8) is the most studied of these proteins, but a large number of chemokines (Regulated on Activation Normal T-cell 
Expressed and Secreted (RANTES), Monocyte Chemoattractant Protein 1 (MCP-1), Monocyte Chemoattractant Pro- 
tein 2 (MCP-2), Monocyte Chemoattractant Protein 3 (MCP-3), Growth-Related gene product a (GROa), Growth-Re- 
lated gene product fj (GRO)P, Growth-Related gene product y (GROtf, Macrophage Inflammatory Protein 1 a (MIP- 
1 a) and (3, etc.) has now been described [4]. Chemokines play fundamental roles in the physiology of acute and chronic 
inflammatory processes as well as in the pathological dysregulations of these processes, by attracting and simulating 
specific subsets of leucocytes [32J. RANTES for example is a chemoattractant for monocytes, memory T-cells and 
eosinophils, and induces the release of histamine by basophils. MCP-1, released by smooth muscle cells in arterio- 
sclerotic lesions, is considered as the factor (or one of the factors) responsible for macrophage attraction and, therefore, 
for the progressive aggravation of the lesions [4]. 

[0004] MlP-tot, MIP-1 fi and RANTES chemokines have recently been described as major HIV-suppressive factors 
produced by CD8 + T-cells [9]. CC-chemokines are also involved in the regulation of human myeloid progenetor cell 
proliferation [6, 7]. 

[0005] Recent studies have demonstrated that the actions of CC- and CXC-chemokines are mediated by subfamilies 
of G protein-coupled receptors. To date, despite the numerous functions attributed to chemokines and the increasing 
number of biologically active ligands, only six functional receptors have been identified in human. Two receptors for 
interleukin-8 (IL-8) have been described [20, 29], One (IL-8RA) binds IL-8 specifically, while the other (IL-8RB) binds 
IL-8 and other CXC-chemokines, like GRO. Among receptors binding CC-chemokines, a receptor, designated CC- 
chemokine receptor t (CCR1), binds both RANTES and MIP-1a [31 ], and the CC-chemokine receptor 2 (CCR2) binds 
MCP-1 and MCP-3 [8, 44, 15]. Two additional CC-chemokine receptors were cloned recently : the CC-chemokine 
receptor 3 (CCR3) was found to be activated by RANTES, MIP-1ct and MIP-1 £ [10]; the CC-chemokine receptor 4 
(CCR4) responds to MIP-1 , RANTES and MCP-1 [37]. In addition to these six functional receptors, a number of orphan 
receptors have been cloned from human and other species, that are structurally related to either CC- or CXC-chemokine 
receptors. These include the human BLR1 [13], EBI1 [5], LCR1 [21], the mouse MIP-1 RL1 and MIP-1 RL2 [17] and 
the bovine PPR1 [25]. Their respective Hgand(s) and function(s) are unknown at present. 

Summary of the invention 

[0006] The present invention is related to a peptide having at least an amino acid sequence which presents more 
than 80%, advantageously more than 90%, preferably more than 95%, homology with the amino acid sequence as 
represented in SEQ ID NO. 1. 

[0007] Preferably, said peptide has also at least an amino acid sequence which presents more than 80%, advanta- 
geously more than 90%, preferably more than 95%, homology with the amino acid sequence as represented in SEQ 
ID NO. 2. 

[0008] According to another embodiment of the present invention, the peptide has at least an amino acid sequence 
which presents more than 80%, advantageously more than 90%, preferably more than 95%, homology with the amino 
acid sequence as represented in SEQ ID NO. 3. 

[0009] The present invention is also related to the amino acid sequence of SEQ ID NO. 1 , SEQ ID NO. 2, SEQ ID 
NO. 3 or a portion thereof (represented in the fig. 1 ). 

[001 0] A "portion of an amino acid sequence" means one or more amino acid segments having the same or improved 
binding properties of the whole peptide according to the invention. Said portion could be an epitope which is specifically 
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binded by a ligand of the peptide which could be a known "natural ligand" of said peptide, an agonist or an analog of 
said ligand, or an inhibitor capable of competitively inhibiting the binding of said ligand to the peptide (including the 
antagonists of said ligand to the peptide). 

[0011] Specific examples of said portions of amino acid sequence and their preparation process are described in 
the publication of Rucker J. et a). (Cell, Vol. 87, pp. 437-446 (1996)) incorporated herein by reference. 
[0012] According to the invention, said portion of the amino acid sequence of the peptide according to the invention 
comprises the N-terminus segment and the first extracellular loop of the peptide. 

[0013] Therefore, according to the invention, the amino acid sequence as* represented in SEQ ID NO. 1 is the common 
amino acid sequence of SEQ ID NO. 2 and of SEQ ID NO. 3 (see also figure tj. Therefore, a first industrial application 
of said amino acid sequence is the identification of the homology between said amino acid sequence and the screening 
of various mutants encoding a different amino acid sequence than the one previously described, and the identification 
of various types of patient which may present a predisposition or a resistance to the disorders described in the following 
specification. 

[0014] Preferably, the peptide according to the invention or a portion thereof is an active CC-chemokine receptor 
[0015] Advantageously, the CC-chemokine receptor according to the invention is stimulated by the MIP-1 (J chem- 
okine at a concentration less or equal to 10 nm, and is advantageously also stimulated by the MIP-1 a or RANTES 
chemokines. However, said chemokine receptor is not stimulated by the MCP-1 , MCP-2, MCP-3, IL-8 and GROct 
chemokines. 

[0016] In addition, the peptide according to the invention or a portion thereof is also a receptor of HIV viruses or a 
portion of said HIV viruses. 

[0017] It is meant by "HIV viruses", HIV-1 or HIV-2 and all the various strains of HIV viruses which are involved in 
the development of AIDS. It is meant by a "a portion of HIV viruses", any epitope of said viruses which is able to interact 
specifically with said receptor. Among said portions of viruses which may be involved in the interaction with the peptide 
according to the invention, are peptides encoded by the ENV and GAG viruses genes. 

[0018] Preferably, said portion of HIV viruses is the glycopeptide gp120/160 (membrane-bound gp 160 or the free gp 
derived therefrom) or a portion thereof. 

[001 9] It is meant by a "portion of the glycopeptide gp1 20/1 60" any epitope, preferably an immuno-dominant epitope, 
of said glycopeptide which may interact specifically with the peptide according to the invention, such as for instance 
the V3 loop (third hypervariable domain). 

[0020] According to another embodiment of the present invention, the peptide according to the invention is an inactive 
CC-chemokine receptor. An example of such inactive CC-chemokine receptor is encoded by the amino acid sequence 
as represented in SEQ ID NO. 2, 

[0021] It is meant by an "inactive CC-chemokine receptor" a receptor which is not stimulated by any known CC- 
chemokine, especially the MIP-1J}, MIP-1 a or RANTES chemokines, 

[0022] The peptide represented in SEQ ID NO. 3 according to the invention is an inactive receptor which is not a 
receptor of HIV viruses or of a portion of said HIV viruses, which means that said inactive receptor does not allow the 
entry of said HIV viruses into a cell which presents at its surface said inactive receptor 
[0023] Advantageously, the peptide according to the invention is a human receptor. 

[0024] The present invention concerns also the nucleic acid molecule having more than 80%, preferably more than 
90%, homology with one of the nucleic acid sequences of SEQ ID NO. 1 , SEQ ID NO. 2 and SEQ ID NO. 3 shown in 
the figures 1 . 

[0025] Preferably, said nucleic acid mofecule has at least the nucleic acid sequence shown in SEQ ID NO. 1 , SEQ 
ID NO. 2 or SEQ ID NO. 3 of figure 1 or a portion thereof, 

[0026] It is meant by a "portion of said nucleic acid molecule" any nucleic acid sequence of more than 1 5 nucleotides . 
which could be used in order to detect and/or reconstitute said nucleic acid molecule or its complementary strand. 
Such portion could be a probe or a primer which could be used in genetic amplification using the PGR, ICR, NASBA 
or CPR techniques for instance. 

[0027] The present invention concerns more specifically the nucleic acid molecules encoding the peptide according 
to the invention. Said nucleic acid molecules are RNA or DNA molecules such as a cDNA molecule or a genomic DNA 
molecule. 

[0028] The present invention is also related to a vector comprising the nucleic acid molecule according to the inven- 
tion. Preferably, said vector is adapted for expression in a cell and comprises the regulatory elements necessary for 
expressing the amino acid molecule in said cell operatively linked to the nucleic acid sequence according to the inven- 
tion as to permit expression thereof. 

[0029] Preferably, said cell is chosen among the group consisting of bacterial cells, yeast cells, insect cells or mam- 
malian cells. The vector according to the invention is a plasmid, preferably a pcDNA3 plasmid, or a virus, preferably 
a baculovirus, an adenovirus or a semliki forest virus. 

[0030] The present invention concerns also the cell, preferably a mammalian cell, such as a CHO-K1 or a HEK293 
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cell, transformed by the vector according to the invention. Advantageously, said cell is non neuronal in origin and is 
chosen among the group consisting of CHO-K1 , HEK293, BHK21 , COS-7 cells. 

[0031 J The present invention also concerns the cell (preferably a mammalian cell such as a CHO-K1 cell) transformed 
by the vector according to the invention and by another vector encoding a protein enhancing the functional response 
in said cell. Advantageously, said protein is the GcciS or Gct16 (G protein, a subunit). Advantageously, said cellis the 
cell CHO-K1-pEFIN hCCR5-1/16, 

[0032] The present invention is also related to a nucleic acid probe comprising a nucleic acid molecule of at least 
15 nucleotides capable of specifically hybridising with a unique sequence included within the sequence of the nucleic 
acid molecule according to the invention. Said nucleic acid probe may be a DnA or a RNA. 

[0033] The invention concerns also an antisense oligonucleotide having a sequence capable of specifically hybrid- 
ising to an mRNA molecule encoding the peptide according to the invention so as to prevent translation of said mRNA 
molecule or an antisense oligonucleotide having a sequence capable of specifically hybridising to the cDNA molecule 
encoding the peptide according to the invention. 

[0034] Said antisense oligonucleotide may comprise chemical analogs of nucleotide or substances which inactivate 
mRNA, or be included in an RNA molecule endowed with ribozyme activity. 

[0035] Another aspect of the present invention concerns a ligand or an anti-ligand (preferably an antibody) other 
than known "natural ligands", which are chosen among the group consisting of the MIP-1f}, MIP-1 a or RANTES chem- 
okines, HIV viruses or a portion of said HIV viruses, wherein said ligand is capable of binding to the receptor according 
to the invention and wherein said anti-ligand is capable of (preferably competitively) inhibiting the binding of said known 
"natural ligand" or the ligand according to the invention to the peptide according to the invention. 
[0036] The exclusion in the above identified definition of known chemokines, HIV viruses or a portion of said HIV 
viruses, does not include variants of said "natural" viruses or said "natural" portion which may be obtained for instance 
by genetic engineering and which may mimic the interaction of said viruses and portion of sard viruses to the peptide 
according to the invention. 

[0037] Advantageously, said antibody is a monoclonal antibody which is preferably directed to an epitope of the 
peptide according to the invention and present on the surface of a cell expressing said peptide. 
[0038] Preferably, said antibody is produced by the hybridome cell AchCCR5-SAB1 A7. 

[0039] The invention concerns also the pharmaceutical composition comprising either an effective amount of the 
peptide according to the invention (in order to delude the HIV virus from the natural peptide present at the surface of 
a mammalian cell and stop the infection of said mammalian cell by the HIV virus), or an effective amount of the above 
identified described ligand and/or anti-ligand, or an effective amount of oligonucleotide according to the invention, 
effective to decrease the activity of said peptide by passing through a cell membrane and binding specifically with 
mRNA encoding the peptide according to the invention in the cell so as to prevent it translation. The pharmaceutical 
composition comprises also a pharmaceutical^ acceptable carrier, preferably capable of passing through said cell 
membrane. 

[0040] Preferably, in said pharmaceutical composition, the oligonucleotide is coupled to a substance, such as a 
ribozyme, which inactivates mRNA encoding the peptide according to the invention. 

[0041] Preferably, the pharmaceutically acceptable carrier comprises a structure which binds to a receptor on a cell 
capable of being taken up by cell after binding to the structure. The structure of the pharmaceutically acceptable carrier 
in said pharmaceutical composition is capable of binding to a receptor which is specific for a selected cell type. 
[0042] The present invention concerns also a transgenic non human mammal overexpressing (or expressing ectop- 
ically) the nucleic acid molecule encoding the peptide according to the invention. 

[0043] The present invention also concerns a transgenic non human mammal comprising an homologous recombi- 
nation knockout of the native peptide according to the invention. 

[0044] According to a preferred embodiment of the invention, the transgenic non human mammal whose genome 
comprises antisense nucleic acid complementary to the nucleic acid according to the invention is so placed as to be 
transcripted into antisense mRNA which is complementary to the mRNA encoding the peptide according to the invention 
and which hybridises to mRNA encoding said peptide, thereby reducing its translation. Preferably, the transgenic non 
human mammal according to the invention comprises a nucleic acid molecule encoding the peptide according to the 
invention and comprises additionally an inducible promoter or a tissue specific regulatory element. 
[0045] Preferably, the transgenic non human mammal is a mouse. 

[0046] The invention relates to a method for determining whether a ligand can be specifically bound to the peptide 
according to the invention, which comprises contacting a cell transfected with a vector expressing the nucleic acid 
molecule encoding said peptide with the ligand under conditions permitting binding of ligand to such peptide and de- 
tecting the presence of any such ligand bound specifically to said peptide, thereby determining whether the ligand 
binds specifically to said peptide. 

[0047] The invention relates to a method for determining whether a ligand can specifically bind to a peptide according 
to the invention, which comprises preparing a cell extract from cells transfected with a vector expressing the nucleic 
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acid molecule encoding said peptide, isolating a membrane fraction from the cell extract, contacting the ligand with 
the membrane fraction under conditions permitting binding of the ligand to such peptide and detecting the presence 
of any ligand bound to said peptide, thereby determining whether the compound is capable of specifically binding to 
said peptide. Preferably, said method is used when the ligand is not previously known. 

[0048] The invention relates to a method for determining whether a ligand is an agonist of the peptide according to 
the invention, which comprises contacting a cell transfected with a vector expressing the nucleic acid molecule encoding 
said peptide with the ligand under conditions permitting the activation of a functional peptide response from the cell 
and detecting by means of a bio-assay, such as a modification in a second messenger concentration (preferably calcium 
ions or inositol phosphates such as IP 3 ) or a modification in the cellular metabolism (preferably determined by the 
acidification rate of the culture medium), an increase in the peptide activity, thereby determining whether the ligand is 
a peptide agonist. 

[0049] The invention relates to a method for determining whether a ligand is an agonist of the peptide according to 
the invention, which comprises preparing a cell extract from cells transfected with a vector expressing the nucleic acid 
molecule encoding said peptide, isolating a membrane fraction from the cell extract, contacting the membrane fraction 
with the ligand under conditions permitting the activation of a functional peptide response and detecting by means of 
a bio-assay, such as a modification in the production of a second messenger (preferably inositol phosphates such as 
IP3}, an increase in the peptide activity, thereby determining whether the ligand is a peptide agonist. 
[0050] The present invention relates to a method for determining whether a ligand is an antagonist of the peptide 
according to the invention, which comprises contacting a cell transfected with a vector expressing the nucleic acid 
molecule encoding said peptide with the ligand in the presence of a known peptide agonist, under conditions permitting 
the activation of a functional peptide response and detecting by means of a bio-assay, such as a modification in second 
messenger concentration (preferably calcium ions or inositol phosphates such as IP 3 ) or a modification in the cellular 
metabolism (preferably determined by the acidification rate of the culture medium), a decrease in the peptide activity, 
thereby determining whether the ligand is a peptide antagonist. 

[0051] The present invention relates to a method for determining whether a ligand is an antagonist of the peptide 
according to the invention, which comprises preparing a cell extract from cells transfected with an expressing the 
nucleic acid molecule encoding said peptide, isolating a membrane fraction from the cells extract, contacting the mem- 
brane fraction with the ligand in the presence of a known peptide agonist, under conditions permitting the activation of 
a functional peptide response and detecting by means of a bio-assay, such as a modification in the production of a 
second messenger, a decrease in the peptide activity, thereby determining whether the ligand is a peptide antagonist. 
[0052] Preferably, the second messenger assay comprises measurement of calcium ions or inositol phosphates such 
as IP 3 . 

[0053] Preferably, the cell used in said method is a mammalian cell non neuronal in origin, such asCHO-K1 , HEK293, 
BHK21 , COS-7 cells. 

[0054] In said method, the ligand is not previously known. 

[0055] The invention is also related to the ligand isolated and detected by any of the preceding methods. 
[0056] The present invention concerns also the pharmaceutical composition which comprises an effective amount 
of an agonist or an antagonist of the peptide according to the invention, effective to reduce the activity of said peptide 
and a pharmaceutically acceptable carrier. 

[0057] It is meant by "an agonist or an antagonist of the peptide according to the invention", all the agonists or 
antagonists of the known "natural ligand" of the peptide as above described. 

[0058] Therefore, the previously described methods may be used for the screening of drugs to identify drugs which 
specifically bind to the peptide according to the invention, 

[0059] The invention is also related to the drugs isolated and detected by any of these methods. 
[0060] The present invention concerns also a pharmaceutical composition comprising said drugs and a pharmaceu- 
tically acceptable carrier. 

[0061] The invention is also related to a method of detecting expression of a peptide according to the invention by 
detecting the presence of mRNA coding for a peptide, which comprises obtaining total RNA or total mRNA from the 
cell and contacting the RNA or mRNA so obtained with the nucleic acid probe according to the invention under hybrid- 
ising conditions and detecting the presence of mRNA hybridised to the probe, thereby detecting the expression of the 
peptide by the cell. 

[0062] Said hybridisation conditions are stringent conditions. 

[0063] The present invention concerns also the use of the pharmaceutical composition according to the invention 
for the treatment and/or prevention of inflammatory diseases, including rheumatoid arthritis, glomerulonephritis, asth- 
ma, idiopathic pulmonary fibrosis and psoriasis, viral infections including Human Immunodeficiency Viruses 1 and 2 
(HIV-1 and 2), cancer including leukaemia, atherosclerosis and/or auto-immune disorders. 

[0064] The present invention concerns also a method for diagnosing a predisposition or a resistance to a disorder 
associated with the activity of the peptide according to the invention and/or associated with infectious agents such as 
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HIV viruses in a subject. Said method comprises : 

a) obtaining nucleic acid molecules encoding the peptide according to the invention from the cells of the subject; 

b) possibly performing a restriction digest of said nucleic acid molecules with a panel of restriction enzymes; 

c) possibly electrophoreticalty separating the resulting nucleic acid fragments on a sized gel; 

d) contacting the resulting gel or the obtained nucleic acid molecule with a nucleic acid probe labelled with a 
detectable marker and capable of specifically hybridising to said nucleic acid molecule (said hybridisation being 
made in stringent hybridisation conditions) ; 

e) detecting labelled bands or the in situ nucleic acid molecules which have hybridised to the said nucleic acid 
molecule labelled with a detectable marker to create a unique band pattern or an in situ marking specific to the 
subject; 

f) preparing other nucleic acid molecules encoding the peptide according to the invention obtained from the cells 
of other patients for diagnosis by step a-e; and 

g) comparing the unique band pattern specific to the nucleic acid molecule of subjects suffering from the disorder 
from step e and the nucleic acid molecule obtained for diagnosis from step f to determine whether the patterns 
are the same or different and to diagnose thereby a predisposition or a resistance to the disorder if the patterns 
are the same or different. 

[0065] The present invention is also related to a method for diagnosing a predisposition or a resistance to a disorder 
associated with the activity of a specific allele of the peptide according to the invention or the presence of said peptide 
at the surface of cells and/or associated with infectious agents such as HIV viruses present in a subject. Said method 
comprises ; 

a) obtaining a sample of a body fluid, preferably a blood sample comprising antigen presenting cells, from a subject; 

b) adding to said sample a ligand and/or an anti-ligand according to the invention; 

c) detecting the cross-reaction between said ligand and/or said anti-ligand and the specific peptide according to 
the invention; and 

d) determining whether the peptide corresponds to a receptor or an inactive receptor according to the invention 
and diagnosing thereby a predisposition or a resistance to the disorder according to the type of the peptide present 
in the body fluid of the subject. 

[0066] The present invention concerns also a diagnostic and/or dosage device, preferably a kit, comprising the pep- 
tides, the nucleic acid molecules, the nucleic acid probes, the ligands and/or the anti-ligands according to the invention, 
their portions (such as primers, probes, epitopes, ...) or a mixture thereof, being possibly labelled with a detectable 
marker. 

[0067] Said diagnostic and/or dosage device comprises also the reactants for the detection and/or the dosage of 
antigens, antibodies or nucleic acid sequences through a method selected from the group consisting of in situ hybrid- 
isation, hybridisation or recognition by marked specific antibodies, specially ELISA® (Enzyme Linked Immunosorbent 
Assay) or RIA® (Radio Immunoassay), methods on filter, on a solid support, in solution, in "sandwich", on gel, by Dot 
blot hybridisation, by Northern blot hybridisation, by Southern blot hybridisation, by isotopic or non-isotopic labelling 
(such as immunofluorescence or biotlnylation), by a technique of cold probes, by genetic amplification, particularly 
PCR, LCR, NASBA or CPR, by a double immunodiffusion, by a counter-immunoelectrophoresis, by haemagglutination 
and/or a mixture thereof. 

[0068] A last aspect of the present invention concerns a method of preparing peptides according to the invention, 
which comprises : * 

a) constructing a vector adapted for expression in a cell which comprises the regulatory elements necessary for 
the expression of nucleic acid molecules in the cell operatively linked to nucleic acid molecule encoding said 
peptide so as to permit expression thereof, wherein the celt is preferably selected from the group consisting of 
bacterial cells, yeast cells, insect cells and mammalian cells; 

b) inserting the vector of step a in a suitable host cell; 

c) incubating the cell of step b under conditions allowing the expression of the peptide according to the invention; 

d) recovering the peptide so obtained; and 

e) purifying the peptide so recovered, thereby preparing an isolated peptide according to the invention. 

[0069] The deposits of micro-organisms AchCCR5-SAB1 A7 and CHO-K1 -pEFIN hCCR5-1/1 6 were made according 
to the Budapest Treaty in the Belgium Coordinated Collection of Micro-organisms (BCCM), Laboratorium voor Molec- 
ulaire Biologie (LMBP), Universiteit Gent, K. L Ledeganckstraat 35, B-9000 GENT, BELGIUM. 
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Short description of the drawings. 

[0070) The figure 1 represents the primary structure of the peptides according to the invention. 
[0071] The figure 2 represents the amino acids sequence of the active human CCR5 chemokine receptor according 
to the invention aligned with that of the human CCR1 , CCR2b, CCR3 and CCR4 receptors. Amino acids identical with 
the active CCR5 sequence are boxed. 

[0072] The figure 3 shows the chromosomal organisation of the human CCR2 and CCR5 chemokine receptor genes. 
[0073] The figure 4 shows the functional expression of the human active CCR5 receptor in a CHO-K1 cell line. 
[0074] The figure 5 represents the distribution of mRNA encoding the CCR5 receptor in a panel of human cell lines 
of haematopoietic origin, 

[0075] The figure 6 represents the structure of the mutant form of human CCR5 receptor 

[0076] The figure 7 represents the quantification of ENV proteins-mediated fusion by lucif erase assays. 

[0077] The figure 8 represents genotyping of individuals by PCR and segregation of the CCR5 alleles in CEPH 

families. 

[0078] The figure 9 represents the FACS analysis of sera anti-CCRS on a CCR5-CHO cell line according to the 
invention. 

[0079] The figure 10 represents the inhibition of HIV infectivity with anti-CCRS antibodies. 
Detailed description of the Invention. 
1 . EXPERIMENTALS 

Materials 

[0080] Recombinant human chemokines, including MCP-1, MlP-1a, MIP-10, RANTES, IL-B and GROct were ob- 
tained from R&D Systems (London, UK). [ 125 l]MIP-1a (specific activity, 2200 Ci/mmol) was obtained from Dupont 
NEN (Brussels, Belgium). Chemokines obtained from R&D Systems were reported by the supplier as >97 % pure on 
SDS-PAGE (sodium dodecyl sulphate-polyacrylamide gel electrophoresis) and biologically active on a bioassay spe- 
cific for each ligand. The lyophilised chemokines were dissolved as a 1 00 ng/ml solution in a sterile phosphate-buffered 
saline (PBS) and this stock solution was stored at -20° C in aliquots. Chemokines were diluted to the working concen- 
tration immediately before use. All cell lines used in the present study were obtained from the ATCC (Rockville, MD, 
USA). 

Cloning and sequencing 

[0081 ] The mouse MOP020 clone was obtained by low stringency polymerase chain reaction, as described previously 
[24, 34], using genomic DNA as template. A human genomic DNA library (Stratagene, La Jolla, CA) constructed in the 
lambda DASH vector was screened at low stringency [39] with the MOP020 (511 bp) probe. The positive clones were 
purified to homogeneity and analysed by Southern blotting. The restriction map of the locus was determined and a 
relevant Xbal fragment of 4,400 bp was subcloned in pBluescript SK+ (Stratagene). Sequencing was performed on 
both strands after subcloning in M13mp derivatives, using fluorescent primers and an automated DNA sequencer 
(Applied Biosystem 370A). Sequence handling and data analysis was carried out using the DNASIS/PROSIS software 
(Hitachi), and the GCG software package (Genetics Computer Group, Wisconsin). 

Expression in ceil lines 

[0082] The entire coding region was amplified by PCR as a 1 056 bp fragment, using primers including respectively 
the BamH\ and Xbal recognition sequences, and cloned after restriction in the corresponding sites of the eukaryotic 
expression vector pcDNA3 (Invitrogen, Sah Diego, CA). The resulting construct was verified by sequencing, andtrans- 
fected in CHO-K1 ceils as described [35]. Two days after transfection, selection for stably transfected cell lines was 
initiated by the addition of 400 jig/ml G41 8 (Gibco), and resistant clones were isolated at day 1 0. CHO-K1 cells were 
cultured using Ham's F12 medium, as previously described [35, 11]. The expression of the active CCR5 receptor in 
the various cell clones was evaluated by measuring the specific transcript level by Northern blotting, on total RNA 
prepared from the cells (see below). 

Binding Assays 



[0083] Stably transfected CHO-K1 cells exp ressing the active CCR5 receptor were grown to confluence and detached 
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from culture dishes by incubation in phosphate-buffered saline (PBS) supplemented with t mM EDTA. Cells were 
collected by low speed centrifugation and counted in a Neubaeur cell. Binding assays were performed in polyethylene 
minisorp tubes (Nunc) in a final volume of 200 uJ PBS containing 0.2 % bovine serum albumin (BSA) and 1 0 s cells, in 
presence of [ 125 IJ -MIP-1ct. Non specific binding was determined by addition of 10 nM unlabelled MIP-1a. The concen- 
tration of labelled ligand was 0.4 nM (around 100 000 cpm per tube) . The incubation was carried out for 2 hours at 4 
°C, and was stopped by the rapid addition of 4 ml ice-cold buffer, and immediate collection of cells by vacuum filtration 
through GF/B glass fiber filters (Whatmann) pre-soaked in 0.5 % polyethyleneinimine (Sigma). Filters were washed 
three times with 4 ml ice-cold buffer and counted in a gamma counter 

Biological activity 

[0084] The CHO-K1 cell lines stably transfected with the pcDNA3/CCR5 construct or wild type CHO-K1 cells (used 
as controls) were plated onto the membrane of Transwell cell capsules (Molecular Devices), at a density of 2.5 10 5 
cells/well in Ham's F12 medium. The next day, the capsules were transferred in a microphysiometer (Cytosensor, 
Molecular Devices), and the celts were allowed to equilibrate for approximately two hours by perfusion of 1 mM phos- 
phate-buffered (pH 7.4) RPMI-1640 medium containing 0.2 % BSA. Cells were then exposed to various chemokines 
diluted in the same medium, for a 2 min duration. Acidification rates were measured at one minute intervals. 

Northern blotting 

[0085] Total RNA was isolated from transfected CHO-K1 cell lines, from a panel of human cell lines of haematopoietic 
origin and from a panel of dog tissues, using the RNeasy kit (Qiagen). RNA samples (1 0 \ig per lane) were denatured 
in presence of glyoxal [26], fractionated on a 1 % agarose gel in a 10 mM phosphate buffer (pH 7.0), and transferred 
to nylon membranes (Pall Biodyne A, Glen Cove, NY) as described [42]. After baking, the blots were prehybridised for 
4h at 42° C in a solution consisting of 50 % formamide, 5x Denhardt solution (1x Denhardt: 0.02 % Ficoll, 0.02 % 
polyvinylpyrolidone, 0.02 % BSA), 5x SSPE (1x SSPE: 0.18 M NaCI, 10 mM Na phosphate, 1 mM EDTA pH 8.3), 0.3 
% Sodium Dodecyl Sulphate (SDS), 250 \ig per ml denatured DNA from herring testes. DNA probes were (ot 32 P)- 
labelled by random priming [14]. Hybridisations were carried out for 12h at 42° C in the same solution containing 10 
% (wt/vol) dextran sulphate and the heat denatured probe. Filters were washed up to 0.1 x SSC (1x SSC: 150 mM 
NaCI, 15 mM Na Citrate pH 7.0), 0.1 % SDS at 60° C and autoradiographed at - 70° C using Amersham p-max films. 

2. RESULTS AND DISCUSSION 

Cloning and structural analysis 

[0086] The sequence homology characterising genes encoding G protein-coupled receptors has allowed the cloning 
by low stringency polymerase chain reaction (PCR) of new members of this gene family [24, 34]. One of the clones 
amplified from mouse genomic DNA, named MOP020 presented strong similarities with characterised chemokine re- 
ceptors, sharing 80 % identity with the MCP-1 receptor (CCR2) [8], 65 % identity with the MIP-1otfRANTES receptor 
(CCR1) [31], and 51 % identity with IL-8 receptors [20, 30]. The clone was used as a probe to screen a human genomic 
library. A total of 16 lambda phage clones were isolated. It was inferred from the restriction pattern of each clone and 
from partial sequence data that all clones were belonging to a single contig in which two different coding sequences 
were included. One of the coding sequences was identical to the reported cDNA encoding the CCR2 receptor [8, 44]. 
A 4.400 pb Xbal fragment of a representative clone containing the second region of hybridisation was subcloned in 
pBluescript SK+. Sequencing revealed a novel gene, tentatively named CCR5, sharing 84 % identity with the MOP020 
probe, suggesting that MOP020 is the mouse ortholog of CCR5. MOP020 does not correspond to any of the three 
mouse chemokine receptor genes cloned recently [16], demonstrating the existence of a fourth murine chemokine 
receptor. 

[0087] The sequence of CCR5 revealed a single open reading frame of 352 codons encoding a protein of 40,600 
Da. The sequence surrounding the proposed initiation codon is in agreement with the consensus as described by 
Kozak [22], since the nucleotide in -3 is a purine. The hydropathy profile of the deduced amino acid sequence is 
consistent with the existence of 7 transmembrane segments. Alignment of the CCR5 amino acid sequence with that 
of other functionally characterised human CC-chemokine receptors is represented in figure 2. The highest similarity is 
found with the CCR2 receptor [8] that shares 75.8 % identical residues. There is also 56.3 % identity with the CCR1 
receptor [31], 58.4 % with the CCR3 [10], and 49.1% with the CCR4 [37]. CCR5 represents therefore a new member 
of the CC-chemokine receptor group [30]. Like the related CCR1 and IL-8 receptors [20, 29, 31 , 16] the coding region 
of CCR5 appears as intronless. From our partial sequencing data, the CCR2 gene is also devoid of intron in the first 
two thirds of its coding sequence. 
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[0068] Sequence similarities within the chemokine receptor family are higher in the transmembrane-spanning do- 
mains, and in intracellular loops. As an example, the identity score between CCR5 and CCR2 goes up to 92% when 
considering the transmembrane segments only. Lower similarities are found in the N-terminal extracellular domain, 
and in the extracellular loops. The N-terminal domain of the IL-8 and CCR2 receptors has been shown to be essential 
for interaction with the ligand [19, 18]. The variability of this region among CC-chemokine receptors presumably con- 
tributes to the specificity towards the various ligands of the family. 

[0089] A single potential site for N-linked glycosylation was identified in the third extracellular loop of CCR5 (figure 
1). No glycosylation site was found in the N-terminal domain of the receptor, where most G protein-coupled receptors 
are glycosylated. The other chemokine receptors CCR1 and CCR2 present such an N-linked glycosylation site in their 
N-terminal domain [31, 8]. By contrast, the CCR3 receptor [10] does not display glycosylation sites neither in the N- 
terminus, nor in extracellular loops. The active CCR5 receptor has four cysteines in its extracellular segments, and all 
four are conserved in the other CC- and CXC-chemokine receptors (figure 2). The cysteines located in the first and 
second extracellular loops are present in most G protein-coupled receptors, and are believed to form a disulphide 
bridge stabilising the receptor structure [41 ]. The two other cysteines, in the N-terminal segment, and in the third 
extracellular loop could similarly form a stabilising bridge specific to the chemokine receptor family. The intracellular 
domains of CCR5 do not include potential sites for phosphorylation by protein kinase C (PKC) or protein kinase A. 
PKC sites, involved in heterologous desensitisation are frequent in the third intracellular loop and C -terminus of G 
protein-coupled receptors. CCR1 is also devoid of PKC sites. In contrast, all CC-chemokine receptors, are rich in serine 
and threonine residues in the C-terminal domain. These residues represent potential phosphorylation sites by the 
family of G protein-coupled receptor kinases, and are probably involved in homologous desensitisation [41]. Five of 
these S/T residues are perfectly aligned in all five receptors (figure 2). 

Physical linkage of the CCR5 and CCR2 genes 

[0090] As stated above, the 16 clones isolated with the MOP020 probe corresponded to a single contig containing 
the CCR5 and CCR2 genes. The organisation of this contig was investigated in order to characterise the physical 
linkage of the two receptor genes in the human genome. A combination of restriction mapping, Southern blotting, 
fragment subcloning and partial sequencing allowed to determine the respective borders and overlaps of all clones. 
Out of the 16 clones, 9 turned out to be characterised by a specific restriction map, and their organisation is depicted 
in figure 3. Four of these clones (#11,18,21, 22) contained the CCR2 gene alone, four clones (# 7, 1 3, 1 5, 1 6) contained 
the ChemR13 gene alone and one clone (#9) contains part of both coding sequences. The CCR2 and CCR5 genes 
are organised in tandem, CCR5 being located downstream of CCR2. The distance separating CCR2 and CCR5 open 
reading frames is 17,5 kb. The chromosomal localisation of the tandem is presently unknown. Other chemokine re- 
ceptors have however been located in the human genome: the CCR1 gene was localised by fluorescence in situ 
hybridisation to the p21 region of human chromosome 3 [1 6]. The two IL-8 receptor genes, and their pseudogene have 
been shown to be clustered on the human 2q34-q35 region [1]. 

Functional expression and pharmacology of the active CCR5 receptor 

[0091] Stable CHO-K1 cell lines expressing the active CCR5 receptor were established and were screened on the 
basis of the level of CCR5 transcripts as determined by Northern blotting. Three clones were selected and tested for 
biological responses in a microphysiometer, using various CC- and CXC-chemokines as potential agonists. Wild type 
CHO-K1 ceils were used as control to ensure that the observed responses were specific for the transfected receptor, 
and. did not result from the activation of endogenous receptors. The microphysiometer allows the real time detection 
of receptor activation, by measuring the modifications of cell metabolism resulting from the stimulation of intracellular 
cascades [33]. Several studies have already demonstrated the potential of microphysiometry in the field of chemokine 
receptors. Modifications of metabolic activity in human monocytes, in response CC-chemokines, were monitored using 
this system [43]. Similarly, changes in the acidification rate of THP-1 cells (a human monocytic cell line) in response 
to MCP-1 and MCP-3 have been measured [36]. The estimation of the EC 50 for both proteins, using this procedure, 
was in agreement with the values obtained by monitoring the intracellular calcium in other studies [8, 15]. 
[0092] Ligands belonging to the CC- and CXC-chemokine classes were tested on the CCR5 transfected CHO-K1 
cells. Whereas MIP-1a, MIP-1J3 and R ANTES were found to be potent activators of the new receptor (figure 4), the 
CC-chemokines MCP-1 , MCP-2 and MCP-3, and the CXC-chemokines GROa and IL-8 had no effect on the metabolic 
activity, even at the highest concentrations tested (30 nM). The biological activity of one of the chemokines inducing 
no response on CCR5 (IL-8) could be demonstrated on a CHO-K1 cell line transfected with the IL-8A interleukin receptor 
(Moliereau et al.. 1 993) : IL-8 produced a 1 60 % increase in metabolic activity as determined using the microphysiom- 
eter. The biological activity of the MCP-2 and MCP-3 preparations as provided by J. Van Damme have been widely 
documented [2, 40], M IP-1 a, MIP-1 p and RANTES were tested on the wild type CHO-K1 cells, at a 30 nM-concentration, 
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and none of them induced a metabolic response. On the CCR5 transfected CHO-K1 cell line, all three active ligands 
(MIP-1a, MIP-1J3 and RANTES) caused a rapid increase in acidification rate, reaching a maximum by the second or 
third minute after perfusion of the ligand. The acidification rate returned to basal level within 10 minutes. The timing of 
the cellular response is similar to that observed for chemokines on their natural receptors in human monocytes [43], 
When agonists were applied repeatedly to the same cells, the response was strongly reduced as compared to the first 
stimulation, suggesting the desensitisation of the receptor. All measurements were therefore obtained on the first stim- 
ulation of each capsule. 

[0093] The concentration-effect relation was evaluated for the three active lipands in the 0.3 to 30 nM range (figure 
3B and C). The rank order of potency was MlP-1ct > M1P-10 = RANTES. At.30 nM concentrations, the effect of MIP- 
1a appeared to saturate (at 156 % of baseline level) while MIP-1p and RANTES were still in the ascending phase. 
Higher concentrations of chemokines could however not be used. The EC50 was estimated around 3 nM for MIP-1a. 
The concentrations necessary for obtaining a biological response as determined by using the microphysiometer are 
in the same range as those measured by intracellular calcium mobilisation for the CCR1 [31], the CCR2A and B [8], 
and the CCR3 [10] receptors. The ligand specificity of CCR5 is similar to that reported for CCR3 [10]. CCR3 was 
described as the first cloned receptor responding to MIP-1J3. However, MIP-10 at 10 nM elicits a significant effect on 
the CCR5, while the same concentration is without effect on the CCR3 transfected cells [10]. These data suggest that 
CCR5 could be a physiological receptor for MIP-1 [J. 

[0094] Binding experiments using [ 125 l]-human MIP-1a as ligand did not allow to demonstrate specific binding to 
CCR53 expressing CHO-K1 cells, using as much as 0.4 nM radioligand and 1 million transfected cells per tube. Failure 
to obtain binding data could be attributed to a relatively low affinity of the receptor for MIP-1 a. 

Northern blotting analysis 

[0095] Northern blotting performed on a panel of dog tissues did not allow to detect transcripts for CCR5. Given the 
role of the chemokine receptor family in mediating chemoattraction and activation of various classes of cells involved 
in inflammatory and immune responses, the probe was also used to detect specific transcripts in a panel of human 
cell lines of haematopoietic origin (figure 5). The panel included lymphoblastic (Raji) and T lymphoblastic (Jurkat) cell 
lines, promyelobtastic (KG-1 A) and promyelocyte (HL-60) ceil lines, a monocytic (THP-1 ) cell line, an erythroleukemia 
(HEL 92.1.7) cell line, a megakaryoblastic (MEG-01) cell line, and a myelogenous leukaemia (K-562) cell line. Human 
peripheral blood mononuclear cells (PBMC), including mature monocytes and lymphocytes, were also tested. CCR5 
transcripts (4.4 kb) could be detected only in the KG-1 A promyeloblasrjc cell line, but were not found in the promyelocytic 
cell line HL-60, in PBMC, or in any of the other cell lines tested. These results suggest that the active CCR5 receptor 
could be expressed in precursors of the granulocytic lineage. CC-chemokines have been reported to stimulate mature, 
granulocytes [27, 38, 23, 2]. However, recent data have also demonstrated a role of CC- and CXC-chemokines in the 
regulation of mouse and human myeloid progenitor cell proliferation [6, 7]. 

[0096] CCR5 was shown to respond to MlP-1a, MIP-1 0 and RANTES, the three chemokines identified as the major 
HlV-suppressive factors produced by CD8 + T cells [9], and released in higher amounts by CD4+ T lymphocytes from 
uninfected butmultiply exposed individuals [51]. CCR5 represents a major co-receptor for macroph age-tropic (M-tropic) 
HIV-1 primary isolates and strains [45, 50]. M-tropic strains predominate during the asymptomatic phase of the disease 
in infected individuals, and are considered as responsible for HIV-1 transmission. Strains adapted for growth in trans- 
formed T-cell lines (T-tropic strains) use as a co-receptor LESTR (or fusin) [50], an orphan receptor also belonging to 
the chemokine receptor family, but not yet characterised functionally [21 , 52, 53]. Dual-tropic viruses, which may rep- 
resent transitional forms of the virus in late stages of infection [54] are shown to use both CCR5 and LESTR as co- 
receptors, as well as the CC-chemokine receptors CCR2b and CCR3 [47]. The broad spectrum of co-receptor usage 
of dual-tropic viruses suggests that within infected individuals, the virus may evolve at least in part from selection by 
a variety of co-receptors expressed on different cell types. 

Identification of an inactive &CCR5 receptor 

[0097] It is known that some individuals remain uninfected despite repeated exposure to HIV-1 [55, 56, 51]. A pro- 
portion of these exposed-uninfected individuals results from the relatively low riskof contamination after a single contact 
with the virus, but it has been postulated that truly resistant individuals do exist. In fact, CD4 + lymphocytes isolated 
from exposed-uninfected individuals are highly resistant to infection by primary M-tropic, but not T-tropic HIV-1 strains. 
Also, peripheral blood mononuclear cells (PBMC) from different donors are not infected equally with various HIV-1 
strains [57-59]. Given the key role played by CCR5 in the fusion event that mediates infection by M-tropic viruses, it 
is postulated that variants of CCR5 could be responsible for the relative or absolute resistance to HIV-1 infection 
exhibited by some individuals, and possibly for the variability of disease progression in infected patients [66]. The 
Inventors selected three HIV-1 infected patients known to be slow progressors, and four seronegative individuals as 
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controls; the full coding region of their CCR5 gene was amplified by PCR and sequenced. Unexpectedly, one of the 
slow progressors : but also two of the uninfected controls, exhibited heterozygosity at the CCR5 locus for a biallelic 
polymorphism. The frequent allele corresponded to the published CCR5 sequence, while the minor one displayed a 
32 bp deletion within the coding sequence, in a region corresponding to the second extracellular loop of the receptor 
(Fig. 6). The figure 6 is the structure of the mutant form of human CC-chemokine receptors, a, The amino acid sequence 
of the non-functional AccrS protein is represented. The transmembrane organisation is given by analogy with the pre- 
dicted transmembrane structure of the wild-type CCR5. Amino acids represented in black correspond to unnatural 
residues resulting from the frame shift caused by the deletion. The mutant'protein lacks the last three transmembrane 
segments of CCR5, as well as the regions involved in G protein-coupling, b,. Nucleotide sequence of the CCR5 gene 
surrounding the deleted region, and translation into the normal receptor (top) or the truncated mutant ( ccrS, bottom). 
The 10-bp direct repeat is represented in italics. The full size coding region of the CCRS gene was amplified by PCR, 
using S'-TCGAGGATCCAAGATGGATTATCAAGT-3' and 5'-CTGATCTAGAGCCATGTGCACAACTCT-3' as forward 
and reverse primers respectively. The PCR products were sequenced on both strands using the same oligonucleotides 
as primers, as welt as internal primers, and fluorochrome-labelled dideoxy nucleotides as terminators. The sequencing 
products were run on an Applied Biosystem sequencer, and ambiguous positions were searched along the coding 
sequence. When the presence of a deletion was suspected from direct sequencing, the PCR products were cloned 
after restriction with BamHt and Xbal endonucleases into pcDNA3. Several clones were sequenced to confirm the 
deletion. The deletion was identical in three unrelated individuals investigated by sequencing. * 
[0098] Cloning of the PCR product and sequencing of several clones confirmed the deletion. The deletion causes a 
frame shift, which is expected to result in premature termination of translation. The protein encoded by this mutant 
allele (AccrS) therefore lacks the last three transmembrane segments of the receptor. A 10-bp direct repeat flanking 
the deleted region (Fig. 6b) on both sides is expected to have promoted the recombination event leading to the deletion. 
Numerous mutagenesis studies performed on various classes of G protein-coupled receptors, including chemokine 
receptors, makes it clear that such a truncated protein is certainly not functional in terms of chemokine-induced signal 
transduction: it lacks the third intracellular loop and C-terminal cytoplasmic domains, the two regions involved primarily 
in G protein coupling [41]. In order to test whether the truncated protein was able to function as a HIV-1 co-receptor, 
the Inventors tested its ability to support membrane fusion by both primary M-tropic and dual-tropic virus ENV proteins. 
The recombinant protein was expressed in quail QT6 ceils together with human CD4. The QT6 cells were then mixed 
with HeLa cells expressing the indicated viral ENV protein and the extent of cell-cell fusion measured using a sensitive 
and quantitative gene-reporter assay. In contrast to wild-type CCR5, the truncated receptor did not allow fusion with 
cells expressing the ENV protein from either M-tropic or dual-tropic viruses (Figure 7). The figure 7 represents the 
quantification of ENV protein-mediated fusion by lucif erase assay. To quantify cell-cell fusion events, Japanese quail 
QT6 fibrosarcoma cells were transfected or cotransfected as indicated with the pcDNA3 vector (Invitrogen) containing 
the coding sequence for wild-type CCRS, the truncated ccrS mutant, the CCR2b or the Duffy chemokine receptors, or 
with the pCDNA3 vector alone. The target cells were also transfected with human CD4 expressed from the CMV 
promoter and the luciferase gene under the control of the T7 promoter. HeLa effector cells were infected (MOI 10) 
with vaccinia vectors expressing T7-polymerase (vTF1 .1 ) and either the JR-FL (vCB28) or 89.6 (v8D3) envelope pro- 
teins. The luciferase activity resulting from cell fusion is expressed as the percentage of the activity (in relative light 
units) obtained for wild-type CCRS. All transfectlons were performed with an identical quantity of plasmid DNA using 
pcDNA3 as carrier when necessary. To initiate fusion, target and effector cells were mixed in 24 well plates at 37 °C 
in the presence of ara-C and rifamptcin, and allowed to fuse for 8 hours. Cells were lysed in 150 u,l of reporter lysis 
buffer (Promega) and assayed for luciferase activity according to the manufacturer's instructions (Promega). 
[0099] Coexpression of AccrS with wild-type CCRS consistently reduced the efficiency of fusion for both JR-FL and 
89.6 envelopes, as compared with CCRS alone. Whether this in vitro inhibitory effect (not shared by the chemokine 
receptor Duffy, used as control) also occurs in vivo is presently not known. Coexpression with the CCR2b receptor 
[31], which is the CC-chemokine receptor most closely related to CCRS but does not promote fusion by M-tropic HIV- 
1 strains [48], did not rescue the mutation by formation of a hybrid molecule (Fig. 7). 

[0100] The figure 8 represents genotyping of individuals by PCR and segregation of the CCRS alleles in CEPH 
families, a, Autoradiography illustrating the pattern resulting from PCR amplification and EcoRI cleavage for individuals 
homozygous for the wild-type CCRS allele (CCR5/CCR5), the null AccrS allele (AccrS/AccrS), and for heterozygotes 
(CCR5/Accr5). A 735 bp PCR product is cleaved into a common band of 332 bp for both alleles, and into 403 and 371 
bp bands for the wild-type and mutant alleles, respectively. b t Segregation of the CCRS alleles in two informative 
families of the CEPH. Half -black and white symbols represent heterozygotes and wild-type homozygotes, respectively 
For a few individuals in the pedigrees t DNA was not available (ND: not determined). PCRs were performed on genomic 
DNA samples, using 5'-CCTGGCTGTCGTCCATGCTG-3' and 5'-CTGATCTAGAGCCATGTGCACAACTCT-3* as for- 
ward and reverse primers respectively. Reaction mixtures consisted in 30 nl of 1 0 mM Tris-HCI buffer pH 8.0, containing 
50 mM Kcl, 0.75 mM MgCI 2 , 0.2 mM dCTP, dGTP and dTTR 0.1 mM dATP, 0.5 uJ [cc- 32 P]-dATP, 0.01% gelatine.. 5% 
DMSO, 200 ng target DNA, 60 ng of each of the primers and 1 .5 U Taq polymerase. PCR conditions were: 93 °C for 
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2 min 30; 93 °C for 1 min, 60 °C for 1 min, 72 °C for 1 min, 30 cycles; 72 °C for 6 min. After the PCR reaction, the 
samples were incubated for 60 min at 37 °C with 10 U EcoR\, and 2 pJ of the denatured reaction mixture was applied 
onto a denaturing 5% polyacrylamide gel containing 35% formamide and 5.6 M urea. Bands were detected by auto- 
radiography. 

[0101] Based on the 14 chromosomes tested in the first experiment, the deleted AccrS allele appeared rather frequent 
in the Caucasian population. The accurate frequency was further estimated by testing (Fig. 8a) a large cohort of Cau- 
casian individuals, including unrelated members of the CEPH (Centre d'Etude des Polymorphismes Humains) families, 
part of the IRIBHN staff, and a bank of anonymous DNA samples from healthy individuals collected by the Genetics 
Department of the Erasme Hospital in Brussels. From a total of more than 700 healthy individuals, the allele frequencies 
were found to be 0.908 for the wild-type allele, and 0.092 for the mutant allele (Table I). The genotype frequencies 
observed in the population were not significantly different from the expected Hardy- Weinberg distribution (CCR5/CCR5: 
0.827 vs 0.824; CCRS/AccrS: 0.162 vs 0.167; Accr5/Accr5: 0.011 vs 0.008, p > 0.999), suggesting that the null allele 
has no drastic effect on fitness. Using two informative CEPH families, it was confirmed that the wild-type CCR5 gene 
and its AccrS variant were allelic, and segregated in a normal mendelian fashion (Fig. 8b). Interestingly, a cohort of 
124 DNA samples originating from Central Africa (collected from Zaire, Burkina Fasso, Cameroun, Senegal and Benin) 
and Japan did not reveal a single Accr5 mutant allele, suggesting that this allele is either absent or very rare in Asian, 
African black populations (Table I). 

[0102] The consequences of the existence of a null allele of CCR5 in the normal Caucasian population were then 
considered in terms of susceptibility to infection by HIV-1. If, as it is predicted, CCR5 plays a major (not redundant) 
role in the entry of most primary virus strains into cells, then AccrS/AccrS individuals should be particularly resistant to 
HIV-1 challenge, both in w'froand in vivo. The frequency of the AccrS/AccrS genotype should therefore be significantly 
lower in HIV-1 infected patients, and increased in exposed-uninfected individuals. Also, if heterozygotes have a sta- 
tistical advantage due to the lower number of functional receptors on their white blood cells, or to the possible dominant- 
negative properties of the mutant allele, the frequency of heterozygotes (and mutant alleles) should be decreased in 
HIV-infected populations. These hypotheses were tested by genotyping a large number of seropositive Caucasian 
individuals (n = 645) belonging to cohorts originating from various hospitals from Brussels, Liege and Paris (Table I). 
Indeed, it was found that within this large series, the frequency of the null AccrS allele was significantly reduced from 
0.092 to 0.053 (p < 10* 5 ). The frequency of heterozygotes was also reduced from 0.162 to 0.106 (p < 0.001) and not 
a single AccrS/ AccrS individual could be found (p < 0.01). 

[0103] Altogether, functional and statistical data suggest that CCR5 is indeed the major co-receptor responsible for 
natural infection by M-tropic HIV-1 strains. Individuals homozygous for the null Accr5 allele (about 1% of the Caucasian 
population) have apparently a strong resistance to infection. It Is unclear at this point whether resistance to HIV-1 is 
absolute or relative, and whether resistance will vary depending on the mode of viral contamination. Larger cohorts of 
seropositive individuals will have to be tested in order to clarify this point. Heterozygotes have a milder though significant 
advantage: assuming an equal probability of contact with HIV, it can be inferred from Table I that heterozygotes have 
a 39% reduction in their likeliness of becoming seropositive, as compared to individuals homozygous for the wild-type 
CCR5 allele. Both a decrease in functional CCR5 receptor number, and a dominant-negative effect of AccrS in vivo, 
comparable to what is observed In the in vitro experiments (Fig. 7) are possible explanations for this relative protection. 
The mutant allele, which can be regarded as a natural knock-out in human, is not accompanied by an obvious phenotype 
in homozygous individuals. Nevertheless, the lack of overt phenotype, taken together with the relative protection that 
characterises heterozygous subjects, suggests that pharmacological agents that selectively block the ability of HIV-1 
to utilise CCR5 as a cofactor, could be effective in preventing HIV-1 infection, and would be predicted not be associated 
with major side effects resulting from CCR5 inactivation. These pharmaceutical agents could be used with other com- 
pounds which are able to block other chemokine receptors used as co-receptors by some HIV-primary isolates in order 
to infect other cells [47]. The prevalence of the null allele in the Caucasian population raises the question of whether 
pandemia of HIV (or related viruses using the same co-receptor) have contributed during mankind's evolution to sta- 
bilise by selection the mutant ccrS allele at such a high frequency. 

Production of antibodies anti-CCR5 

[01 04] Antibodies were produced by genetic immunisation. Six week old females balb/c mice were used. DNA coding 
for the human CCRS receptor was inserted in the expression vector pcDNA3 under the control of the CMV promotor 
and 1 00 ug DNA was injected in the anterior tibial muscle, five days after pre-treatment of this muscle with cardiotoxine 
(from venom of Naja Nigricolis). Injections were repeated twice at three week intervals. Fifteen days after the last 
injection, blood was taken from each animal and sera were tested for the presence of anti-CCR5 antibodies. 
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Test of sera using Fluorescence Activated Cell Sorter (FACS) 

[0105] Sera were tested by fluorescence activated cell sorting using recombinant CHO cells expressing the CCR5 
receptor. Briefly, cells were detached using a PBS-EDTA-EGTA solution and incubated into PBS-BSA medium for 30 
minutes at room temperature with 5 u.l serum on the basis of 100,000 cells per tube. Cells were then washed and 
incubated for 30 minutes in ice together with anti-mouse antibody labelled with fluorescein. Cells were washed, taken 
up into 200 uJ of a PBS-BSA solution and fluorescence was analysed by FACS (FACSCAN, Becton-Oickinson). 1 0,000 
cells were counted. Wild type CHO or recombinant CHO cells expressing the human CCR2b receptor were used as 
controls. 

[0106] When tested by FACS analysis 2 weeks after the last injection (figure 9), all the sera from mice immunised 
with CCR5 cDNA, clearly recognised the native receptor expressed on CHO cells (mean of fluorescence = 200), without 
significant cross reaction with control cells expressing CCR2b (mean of fluorescence = 20). 

[0107] Sera were tested on either a CHO cell line expressing high level of CCR5 receptor (black histogram) or a 
CHO cell line expressing CCR2b receptor (white histogram) as negative control. Each serum was tested individually. 

Antibodies anti-CCRS and HIV infectivity 

[0108] Peripheral blood mononuclear cells (PBMC) from one donor homozygous from wild type CCR5 gene, were 
isolated and cultivated 3 days in presence of PHA. 

[0109] On day 4, 800 jit of cells (10 5 cells/ml) were incubated with 8 uJ of sera from mice immunised with CCR5 
cDNA, 30 minutes at 37 °C. 1 ml of viral solution (JRCSF HIV strain) is then added and incubated during 2 hours. Cells 
were then washed twice and cultivated during 15 days. 

[0110] Aliquot of medium is taken at days 0, 4, 7, 10 and 14 and the dosage of antigen p24 is performed. 
[0111] 14 days after the beginning of the experiment, one serum (serum B0) totally block the production of p24, 
indicating its ability to block the infection of the lymphocytes by this HIV strain (figure 1 0). Other serums also exhibit a 
partial or total effect on this infection (serum A2 and B1 ). All the other sera did not show any effect on this infection. 

Production of monoclonal antibodies 

[0112] Mice with the highest title of CCR5 antibodies were selected for monoclonal antibodies production and injected 
intravenously with 10 7 recombinant CHO-K1 cells expressing human CCR5 receptors. Three days later, animals were 
sacrificed and fusion of splenic cells or cells from lymph nodes near the site of injection with SP2/0 myeloma cells, 
were performed. Fusion protocol used was that of Galfre et al. (Nature 266, 550 (1 977)). A selective HAT (hypoxanthine/ 
aminopterin/thymidin) medium is used to select hybridomas and their supernatants are tested by FACS using recom- 
binant CHO cells expressing the human CCR5 receptor, as it was done for the sera. Positives hybridomas are then 
cloned by limited dilution. Clones that are shown positive by FACS analyses are then expanded and produced in ascites 
in balb/C mice. 
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SEQUENCE LISTING 



I) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: EURO SCREEN S.A. 

fB) STREET: Avenue des Becassines, 7- 

(C) CITY: BRUSSELS 

{ E) COUNTRY: BELGIUM 

(F) POSTAL CODE (ZIP) : 3-1160 

(A) NAME : SAMSON Michel 

(B) STREET: rue Victor Marquigny II 

(C) CITY: GENT ILLY 

{ E) COUNTRY: FRANCE 

(F) POSTAL CODE (ZIP) : F-94250 

(A) NAME: PARMSNTISR Marc 

(3) STREET: Chaussee d'Uccle, 304 

(C) CITY: LINKE3EEK 

( 2) COUNTRY: BELGIUM 

(?) POSTAL CODE (ZIP) : B-1604 

(A) NAME; VASSART Gilbert 

(3) STREET: Avenue Lambeau, 13 
(C) CITY: BRUSSELS 

(2) COUNTRY: BELGIUM 

(F) POSTAL CODE (ZIP): B-12Q0 

(Ai NAME: LIBERT Frederick 

(3) STREET: Avenue Grand- Peine, 21 
(C) CITY: BRAINS L ' ALLEUD 

( E) COUNTRY: BELGIUM 

(F) POSTAL CODE (ZIP) : B-1428 

(ii) TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOXINES RECEPTOR 
AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 

(iii) NUMBER OF SEQUENCES: 6 

tiv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS/MS-DOS 

(D) SOFTWARE: Patently Release #1.0, Version #1.25 (EPOJ 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/3E97 / 00023 

INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 792 base pairs 
-:B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 



iii) MOLECULE TYPE: DNA ( genomic) 



BP 1 199 360 A2 



i ix) FEATURE: 

(A) NAME /KEY : CDS 

(3) LOCATION: 240.. 791 



(xi) SEQUENCE DESCRIPTION: SEQ ID WO: 1: 

GAATTCCCCC AACAGAGCCA AGCTCTCCAT CTAGTGGACA GGGAAGCTAG CAGCAAACCT 60 

TCCCTTCACT ACAAAACTTC ATTGCTTGGC CAAAAAGAGA GTTAATTCAA TGTAGACATC 120 

TATGTAGGCA ATTAAAAACC TATTGATGTA TAAAACAGTT TGCATTCATG GAGGGCAACT 130 

AAATACATTC TAGGACTTTA TAAAAGATCA CTTTTTATTT ATGCACAGGG TGGAACAAG 23 9 

ATG GAT TAT CAA GTG TCA AGT CCA ATC TAT GAC ATC AAT TAT TAT ACA 237 
Me- Asp Tyr Gin Val Ser Ser Pro lie Tyr Asp lie Asn Tyr Tyr Thr 
1 5 10 15 

TCG GAG CCC TGC CAA AAA ATC AAT GTG AAG CAA ATC GCA GCC CGC CTC 335 
Ser Glu Pro Cys Gin Lys lie Asn Val Lys Gin lie Ala Ala Arg Leu 
20 25 30 

CTG CCT CCG CTC TAC TCA CTG GTG TTC ATC TTT GGT TTT GTG GGC AAC 333 
Leu Pro Pro Leu Tyr Ser Leu Val Phe He Phe Gly Phe Val Gly Asn 
35 40 45 

ATG CTG GTC ATC CTC ATC CTG ATA AAC TGC AAA AGG CTG AAG AGC ATG 431 
Met Leu Val lie Leu He Leu He Asn Cys Lys Arg Leu Lys Ser Met 
50 55 60 

ACT GAC ATC TAC CTG CTC AAC CTG GCC ATC TCT GAC CTG TTT TTC CTT 479 
Thr Asp He Tyr Leu Leu Asn Leu Ala He Ser Asp Leu Phe Phe Leu 
65 70 75 80 

CTT ACT GTC CCC TTC TGG GCT CAC TAT GCT GCC GCC CAG TGG GAC TTT 527 
Leu Thr Val Pro Phe Trp Ala His Tyr Ala Ala Ala Gin Trp Asp Phe 
85 90 95 

GGA AAT ACA ATG TGT CAA CTC TTG ACA GGG CTC TAT TTT ATA GGC TTC 575 
Gly Asn Thr Met Cys Gin Leu Leu Thr Gly Leu Tyr Phe He Gly Phe 
100 105 110 

TTC TCT GGA ATC TTC TTC ATC ATC CTC CTG ACA ATC GAT AGG TAC CTG 623 
Phe Ser Gly He Phe Phe He He Leu Leu Thr He Asp Arg Tyr Leu 
115 120 125 

GCT GTC GTC CAT GCT GTG TTT GCT TTA AAA GCC AGG ACG GTC ACC TTT 671 
Ala Val Val His Ala Val Phe Ala Leu Lys Ala Arg Thr Val Thr Phe 
130 135 140 

GGG GTG GTG ACA AGT GTG ATC ACT TGG GTG GTG GCT GTG TTT GCG TCT 719 
Gly Val val Thr Ser Val lie Thr Trp Val Val Ala Val Phe Ala Sar 
-45 150 155 160 

CTC CCA GGA ATC ATC TTT ACC AGA TCT CAA AAA GAA GGT CTT CAT TAC 7 57 

Leu Pro Gly He He Phe Thr Arg Ser Gin Lys Glu Gly Leu His Tyr 
155 170 175 

ACC TGC AGC TCT CAT TTT CCA TAC A 7 52 

Thr Cys Ser Ser His Phe Pro Tyr 
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130 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(AJ LENGTH: 134 amino acids 
(3) TYPS: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: procein 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Asp Tyr Gin Val Ser Sar Pro lie Tyr Asp lie Asn Tyr Tyr Thr 
15 10 15 

Ser Glu Pro Cys Gin Lys lie Asn Val Lys Gin lie Ala Ala Arg Leu 
20 25 30 

Leu Pro Pro Leu Tyr Ser Leu Val Phe lie Phe Gly Phe Val Gly Asn 
35 40 45 

Met Leu Val lie Leu lie Leu lie Asn Cys Lys Arg Leu Lys Ser Met 
50 55 50 

Thr Asp lie Tyr Leu Leu Asn Leu Ala He Ser Asp Leu Phe Phe Leu 
65 70 75 80 

Leu Thr Val Pro Phe Trp Ala His Tyr Ala Ala Ala Gin Trp Asp Phe 
85 90 95 

Gly Asn Thr Met Cys Gin Leu Leu Thr Gly Leu Tyr Phe He Gly Phe 
100 105 110 

Phe Ser Gly He Phe Phe He lie Leu Leu Thr He Asp Arg Tyr Leu 
115 120 125 

Ala Val Val His Ala Val Phe Ala Leu Lys Ala Arg Thr Val Thr Phe 
130 135 140 

Gly Val Val Thr Ser Val He Thr Trp Val Val Ala Val Phe Ala Ser 
145 150 155 ISO 

Leu Pro Gly He He Phe Thr Arg Ser Gin Lys Glu Gly Leu His Tyr 
165 170 175 

Thr Cys Ser Ser His Phe Pro Tyr 
180 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1477 base pairs 
(B> TYPE: nucleic acid 
{C> STRAMDEDNESS: single 
(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: DNA (genomic) 
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fix} FEATURE: 

(A} NAME /KEY : CDS 

(B) LOCATION: 240.. 1295 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GAATTCCCCC AACAGAGCCA AGCTCTCCAT CTAGTGGACA GGGAAGCTAG CAGCAAACCT 60 

TCCCTTCACT ACAAAACTTC ATTGCTTGGC CAAAAAGAGA GTTAATTCAA TGTAGACATC 12 0 

TATGTAGGCA ATTAAAAACC TATTGATGTA TAAAACAGTT TGCATTCATG GAGGGCAACT 180 

AAATACATTC TAGGACTTTA TAAAAGATCA CTTTTTATTT ATGCACAGGG TGGAACAAG 23 9 

ATG GAT TAT CAA GTG TCA AGT CCA ATC TAT GAC ATC AAT TAT TAT ACA 28? 
Met Asp Tyr Gin Val Ser Ser Pro lie Tyr As? He Asn Tyr Tyr Thr 
15 10 15 

TCG GAG CCC TGC CAA AAA ATC AAT GTG AAG CAA ATC GCA GCC CGC CTC 335 
Ser Glu Pre Cys Gin Lys lie Asn Val Lys Gin He Ala Ala Arg Leu 
20 25 30 

CTG CCT CCG CTC TAC TCA CTG GTG TTC ATC TTT GGT TTT GTG GGC AAC 3 83 

Leu Pro Pre Leu Tyr Ser Leu Val Phe Ha Phe Gly Phe Val Gly Asn 
35 40 45 

ATG CTG GTC ATC CTC ATC CTG ATA AAC TGC AAA AGG CTG AAG AGC ATG 431 
Met Leu Val He Leu He Leu He Asn Cys Lys Arg Leu Lys Ser Met 
50 55 60 

ACT GAC ATC TAC CTG CTC AAC CTG GCC ATC TCT GAC CTG TTT TTC CTT 479 
Tar Asp He Tyr Leu Leu Asn Leu Ala He Ser Asp Leu Phe Phe Leu 
55 70 75 80 

CTT ACT GTC CCC TTC TGG GCT CAC TAT GCT GCC GCC CAG TGG GAC TTT 527 
Leu Thr Val Pro Phe Trp Ala His Tyr Ala Ala Ala Gin Trp Asp Phe 
85 90 95 

GGA AAT ACA ATG TGT CAA CTC TTG ACA GGG CTC TAT TTT ATA GGC TTC 57 5 

Gly Asn Thr Met Cys Gin Leu Leu Thr Gly Leu Tyr Phe He Gly Phe 
100 105 110 

TTC TCT GGA ATC TTC TTC ATC ATC CTC CTG ACA ATC GAT AGG TAC CTG 623 
Phe Ser Gly He Phe Phe He He Leu Leu Thr He Asp Arg Tyr Leu 
115 120 125 

GCT GTC GTC CAT GCT GTG TTT GCT TTA AAA GCC AGG ACG GTC ACC TTT 671 
Ala Val Val His Ala Val Phe Ala Leu Lys Ala Axg Thr Val Thr Phe 
130 135 140 

GGG GTG GTG ACA AGT GTG ATC ACT TGG GTG GTG GCT GTG TTT SCG TCT 719 
Gly Val Val Thr Ser Val Ha Thr Trp Val Val Ala Val Phe Ala Ser 
145 150 155 160 

CTC CCA GGA ATC ATC TTT ACC AGA TCT CAA AAA GAA GGT CTT CAT TAC 7 67 

Leu Pro Gly He He Phe Thr Arg Ser Gin Lys Glu Gly Leu His Tyr 
165 170 175 

ACC TGC AGC TCT CAT TTT CCA TAC AGT CAG TAT CAA TTC TGG AAG AAT 315 
Thr Cys Ser Ser His Phe Pro Tyr Ser Gin -Tyr Gin Phe Trp Lys Asn 
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180 185 190 

TTC CAG ACA TTA AAG ATA GTC ATC TTG GGG CTG GTC CTG CCG CTG CTT 863 
Phe Gin Thr Leu Lys He Val lie Leu Gly Leu Vai Leu Pro Leu Leu 
195 200 205 

GTC ATG GTC ATC TGC TAC TCG GGA ATC CTA AAA ACT CTG CTT CGG TGT 91 1 

Val Me: Val He Cys Tyr Ser Gly He Leu Lys Thr- Leu Leu Arg Cys 
210 215 220 ,« 

CGA AAT GAG AAG AAG AGG CAC AGG GCT GTG AGG CTT ATC TTC ACC ATC 959 
Arg Asn Glu Lys Lys Arg His Arg Ala Val Arg Leu He Phe Thr He 
225 230 235 240 

ATG ATT GTT TAT TTT CTC TTC TGG GCT CCC TAC AAC ATT GTC CTT CTC 1007 
Met He Val Tyr Phe Leu Phe Trp Ala Pro Tyr Asn lie Val Leu Leu 
245 250 255 

CTG AAC ACC TTC CAG GAA TTC TTT GGC CTG AAT AAT TGC AGT AGC TCT 1055 
Leu Asn Thr Phe Gin Glu Phe Phe Gly Leu Asn Asn Cys Ser Ser Ser 
260 255 270 

AAC AGG TTG GAC CAA GCT ATG CAG GTG ACA GAG ACT CTT GGG ATG ACG 1103 
Asn Arg Leu Asp Gin Ala Met Gin Val Thr Glu Thr Leu Gly Met Thr 
275 280 285 

CAC TGC TGC ATC AAC CCC ATC ATC TAT GCC TTT GTC GGG GAG AAG TTC 1151 
His Cys Cys He Asn Pro lie He Tyr Ala Phe Val Gly Glu Lys Phe 
290 295 300 

AGA AAC TAC CTC TTA GTC TTC TTC CAA AAG CAC ATT GCC AAA CGC TTC 1199 
Arg Asn Tyr Leu Leu Val Phe Phe Gin Lys His He Ala Lys Arg Phe 
305 310 315 320 

TGC AAA TGC TGT TCT ATT TTC CAG CAA GAG GCT CCC GAG CGA GCA AGC 1247 
Cys Lys Cys Cys Ser He Phe Gin Gin Glu Ala Pro Glu Axg Ala Ser 
325 330 335 

TCA GTT TAC ACC CGA TCC ACT GGG GAG CAG GAA ATA TCT GTG GGC TTG 1295 
Ser Val Tyr Thr Arg Ser Thr Gly Glu Gin Glu He Ser Val Gly Leu 
340 345 350 

TGACACGGAC TCAAGTGGGC TGGTGACCCA GTCAGAGTTG TGCACATGGC TTAGTTTTCA 1355 

TAC AC AGC CT GGGCTGGGGG TNGGTTGGNN GAGGTCTTTT TTAAAAGGAA GTTACTGTTA 1415 

TAGAGGGTCT AAGATTCATC CATTTATTTG GCATCTGTTT AAAGTAGATT AGATCCC-AAT 1475 

TC 1477 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 352 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

fsci) SEQUEMCE DESCRIPTION: SEQ ID NO: 4: 



21 



EP 1 199 360 A2 



Met Asp Tyr Gin Val Ser Ser Pro lie Tyr Asp lie Asn Tyr Tyr Thr 
15 10 15 

Ser Glu Pro Cys Gin Lys lie Asn Val Lys Gin lie Ala Ala Arg Leu 
20 25 30 

Leu Pro Pro Leu Tyr Ser Leu Val phe lie Phe Gly Phe Val Gly Asn 
3 5 40 . •■' 45 

Mec Leu Val lis Leu lie Leu lie Asn Cys Lys Arg Leu Lys Ser Mec 
50 55 " 60 

Thr Asp He Tyr Leu Leu Asn Leu Ala lie Ser Asp Leu Phe Phe Leu 
65 70 75 80 

Leu Thr Val Pro Phe Trp Ala His Tyr Ala Ala Ala Gin Trp Asp Phe 
35 90 95 

Gly Asn Thr Mec Cys Gin Leu Leu Thr Gly Leu Tyr Phe lie Gly Phe 
100 105 110 

Phe Ser Gly lie Phe Phe lie lie Leu Leu Thr lie Asp Arg Tyr Leu 
115 120 125 

Ala Val Val His Ala Val Phe Ala Lau Lys Ala Arg Thr Val Thr Phe 
130 135 140 

Gly Val val Thr Ser Val Xla Thr Trp Val Val Ala Val Phe Ala Ser 
145 150 155 160 

Leu Pro Gly lie lie Phe Thr Arg Ser Gin Lys Glu Gly Leu His Tyr 
165 170 175 

Thr Cy3 Ser Ser His Phe Pro Tyr Ser Gin Tyr Gin Phe Trp Lys Asn 
180 185 190 

Phe Gin Thr Leu Lys lie Val He Leu Gly Leu Val Leu Pro Leu Leu 
195 200 205 

Val Met Val II a Cys Tyr Ser Gly lie Leu Lys Thr Leu Leu Arg Cys 
210 215 220 

Arg Asn Glu Lys Lys Arg His Arg Ala Val Arg Leu He Phe Thr He 
225 230 235 240 

Met lie Val Tyr Phe Leu Phe Trp Ala Pro Tyr Asn He Val Leu Leu 
245 250 255 

Leu Asn Thr Phe Gin Glu Phe Phe Gly Leu Asn Asn Cys Ser Ser Ser 
260 265 270 

Asn Arg Leu Asp Gin Ala Mec Gin Val Thr Glu Thr Leu Gly Met Thr 
275 280 285 

His Cys Cys He Asn Pro He lis Tyr Ala Phe Val Gly Glu Lys Phe 
290 295 300 

Arg Asn Tyr Leu Leu Val Phe Phe Gin Lys His He Ala Lys Arg Phe 
305 310 315 320 
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Cys Lys Cys Cys Ser lie Phe Gin Gin Glu Ala Pro Glu Arg Ala Ser 
325 330 335 

Ser Val Tyr Thr Arg Ser Thr Gly Glu Gin Glu lie Ser Val Gly Leu 
340 345 350 



(2) INFORMATION FOR SEQ ID NO: 5 : 

W (iJ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1442 base pairs 
«B) TYPE: nucleic acid 
(C> STRANDSDNESS : Single 
(D) TOPOLOGY: linear 

15 (iii MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
20 ( B) LOCATION: 240.. 884 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GAATTCCCCC AACAGAGCCA AGCTCTCCAT CTAGTGGACA GGGAAGCTAG CAGCAAACCT SO 

25 

TCCCTTCACT ACAAAACTTC ATTGCTTGGC CAAAAAGAGA GTTAATTCAA TGTAGACATC 120 

TATGTAGGCA ATTAAAAACC TATTGATGTA TAAAACAGTT TGCATTCATG GAGGGCAACT 130 

3Q AAATACATTC TAGGACTTTA TAAAAGATCA CTTTTTATTT ATGCACAGGG TGGAACAAG 239 

ATG GAT TAT CAA GTG TCA AGT CCA ATC TAT GAC ATC AAT TAT TAT AC A 287 
Met Asp Tyr Gin Val Ser Ser Pro He Tyr Asp lie Asn Tyr Tyr Thr 
- 5 10 15 

35 TCG GAG CCC TGC CAA AAA ATC AAT GTG AAG CAA ATC GCA GCC CGC CTC 33 5 

Ser Glu Pro Cys Gin Lys He Asn Val Lys Gin He Ala Ala Arg Leu 
20 25 30 

CTG CCT CCG CTC TAC TCA CTG GTG TTC ATC TTT GGT TTT GTG GGC AAC 383 
.„ Leu Pro p ro Leu Tyr Ser Leu Val Phe He Phe Gly Phe Val Gly Asn 

40 3 5 40 - 45 

ATG CTG GTC ATC CTC ATC CTG ATA AAC TGC AAA AGG CTG AAG AGC ATG 431 
Met Leu Val He Leu He Lau He Asn Cys Lys Arg Leu Lys Ser Met 
50 55 60 

45 

ACT GAC ATC TAC CTG CTC AAC CTG GCC ATC TCT GAC CTG TTT TTC CTT 479 
Thr Asp He Tyr Leu Leu Asn Leu Ala He Ser Asp Leu Phe Phe Leu 
65 70 75 30 

CTT ACT GTC CCC TTC TGG GCT CAC TAT GCT GCC GCC CAG TGG GAC TTT 527 
Leu Thr Val Pro Phe Trp Ala His Tyr Ala Ala Ala Gin Trp Asp Phe 
S5 90 95 

GGA AAT ACA ATG TGT CAA CTC TTG ACA GGG CTC TAT TTT ATA GGC TTC 575 
Gly Asn Thr Met Cys Gin Leu Leu Thr Gly Leu Tyr Phe He Gly Phe 

55 100 105 no 
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TTC TCT GGA ATC TTC TTC ATC ATC CTC CTG AC A ATC GAT AGG TAC CTG 623 
Phe Ser Gly He Phe Phe He lie Leu Leu Thr He Asp Arg Tyr Leu 
115 120 125 

GCT GTC GTC CAT GCT GTG TTT GCT TTA AAA GCC AGG ACG GTC ACC TTT 671 
Ala Val Val Ki3 Ala Val Phe Ala Leu Lys Ala Arg Thr Val Thr Phe 
130 135 140 

GGG GTG GTG ACA AGT GTG ATC ACT TGG GTG GTG GCT G?G TTT GCG TCT 719 
Gly Val Val Thr Ser Val He Thr Trp Val Val Ala Val Phe Ala Ser 
145 150 155 160 

CTC CCA GGA ATC ATC TTT ACC AGA TCT CAA AAA GAA GGT CTT CAT TAC 767 
Leu Pro Gly He He Phe Thr Arg Ser Glr* Lys Glu Gly Leu His Tyr 
165 170 175 

ACC TGC AGC TCT CAT TTT CCA TAC ATT AAA GAT AGT CAT CTT GGG GCT 815 
Thr Cys Ser Ser His Phe Pro Tyr He Lys Asp Ser His Leu Gly Ala 
180 185 190 

GGT CCT GCC GCT GCT TGT CAT GGT CAT CTG CTA CTC GGG AAT CCT AAA 363 
Gly Pro Ala Ala Ala Cys His Gly His Leu Leu Leu Gly Asn Pro Lys 
195 200 205 

AAC TCT GCT TCG GTG TCG AAA TGAGAAGAAG AGGCACAGGG CTGTGAGGCT 914 
Asn Ser Ala Ser Val Ser Lys 
210 215 

T ATC TTC ACC ATCATGATTG TTT ATT TTC T CTTCTGGGCT CCCTACAACA TTGTCCTTCT 974 

CCTGAACACC TTCCAGGAAT TCTTTGGCCT GAATAATTGC AGTAGCTCTA ACAGGTTGGA 1034 

CCAAGCTATG CAGGTGACAG AGACTCTTGG GATGACGCAC TGCTGCATCA ACCCCATCAT 1094 

CTATGCCTTT GTCGGGGAGA AGTTC AG AAA C TAC CTC TTA GTCTTCTTCC AAAAGCACAT 1154 

TGCCAAACGC TTCTGCAAAT GCTGTTCTAT TTTCCAGCAA GAGGCTCCCG AGCGAGCAAG 1214 

CTCAGTTTAC ACCCGATCCA CTGGGGAGCA GGAAATATCT GTGGGCTTGT GACACGGACT 1274 

CAAGTGGGCT GGTGACCCAG TCAGAGTTGT GCACATGGCT TAGTTTTCAT AC AC AGC CTG 1334 

GGCTGGGGGT GGTTGGGAGG TCTTTTTTAA AAGGAAGTTA CTGTTATAGA GGGTCTAAGA 1394 

TTC ATC C ATT TATTTGGCAT CTGTTTAAAG TAGATTAGAT CCGAATTC 1442 

(2) INFORMATION FOR SSQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 215 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID WO: 6: 

Me*: Asp Tyr Gin Val Ser Ser Pro He Tyr Asp He Asn Tyr Tyr Thr 
15 10 15 
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Ser GIu Pro Cys Gin Lys He Asn Val Lys Gin He Ala Ala Arg Leu 
20 25 30 

Leu Pro Pro Leu Tyr Ser Leu Val Phe He Phe Gly Phe Val Gly Asn 
35 40 45 

Met Leu Val He Leu He Leu He Asn Cys Lys Arg Leu Lys Ser Met 
50 55 , 60 

Thr Asp He Tyr Leu Leu Asn Leu Ala He Ser Asp Leu Phe Phe Leu 
65 70 75 80 

Leu Thr Val Pro Phe Trp Ala His Tyr Ala Ala Ala Gin Trp Asp Phe 
85 90 95 

Gly Asn Thr Met Cys Gin Leu Leu Thr Gly Leu Tyr Phe He Gly Phe 
100 105 110 

Phe Ser Gly He Phe Phe He He Leu Leu Thr He Asp Arg Tyr Leu 
115 120 125 

Ala Val Val His Ala Val Phe Ala Leu Lys Ala Arg Thr Val Thr Phe 
130 135 140 

Gly Val Val Thr Ser Val He Thr Trp Val Val Ala Val Phe Ala Ser 
145 150 155 160 

Leu Pro Gly He He Phe Thr Arg Ser Gin Lys Glu Gly Leu His Tyr 
165 170 175 

Thr Cys Ser Ser Hi3 Phe Pro Tyr He Lys Asp Ser His Leu Gly Ala 
180 185 190 

Gly Pro Ala Ala Ala Cys His Gly His Leu Leu Leu Gly Asn Pro Lys 
195 200 205 

Asn Ser Ala Ser val Ser Lys 
210 215 



Claims 

1. Peptide having at least an amino acid sequence which presents more than 80% homology with the amino acid 
sequence as represented in SEQ ID NO. 3 shown in figures 1 . 

2. Peptide according to the claim 1 , having at least an amino acid sequence which presents more than 90% homology 
with the amino acid sequence as represented in SEQ ID NO. 3 shown in figures 1 . 

3. Peptide according to the claim 1 or 2 having the amino acid sequence of SEQ ID NO. 3 or a portion thereof. 

4. Peptide according to any of the claims 1 to 3, characterised In that it is an inactive CC chemokine receptor. 

5. Peptide according to the claim 4, characterised in that the receptor is not stimulated by the MIP-1 {3 chemokine 
at a concentration less or equal to 10 nM. 

6. Peptide according to the claim 4 or 5 t characterised in that the receptor is not stimulated by the MIP-1 a or 
RANTES chemokines. 
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7. Peptide according to any of the claims 4 to 6, characterised in that trie receptor is not stimulated by the MCP-1 , 
MCP-2, MCP-3, IL-8 and GROcc chemokines. 

8. Peptide according to any of the claims 1 to 7, characterised in that it is not a receptor of HIV-1 and/or HIV-2 
viruses or a portion of said HIV viruses. 

9. Peptide according to any of the claims 1 to 8, being a human receptor. 

10. Nucleic acid molecule having more than 80%, preferably more than 90%, homology with the nucleic acid sequence 
SEQ ID NO. 3 shown in figure 1 . 

1 1 . Nucleic acid molecule according to the claim 1 0, which has at least the nucleic acid sequence SEQ ID NO. 3 shown 
in figure 1 or a portion (such as a probe or a primer) thereof encoding the peptide according to any of the preceding 
claims 1 to 9. 

1 2. Nucleic acid probe comprising a nucleic acid molecule of at least 1 5 nucleotides capable of specifically hybridising 
with an unique sequence included within the nucleic acid molecule according to claim 10 or 11. 

13. Antibody directed to an epitope of the peptide according to any of the claims 1 to 9. 

14. Method for the identification of a patient which present a predisposition or a resistance to infectious agents, which 
comprises the steps of obtaining a biological sample from a patient and identifying if said patient comprises cells 
expressing the peptide according to any of the preceding claims 1 to 9, or cells comprising nucleic acid molecules 
according to the claim 1 0 or 11 , or comprises an antibody according to the claim 1 3, and identifying if said patient 
may present a predisposition or a resistance to infectious agents. 

15. Method according to the claim 14, wherein said infectious agents are the HIV-1 and/or the HIV-2 viruses. 

16. Method according to the claim 14 or 15, wherein said method comprises the step of detecting the expression of 
the peptide according to any of the preceding claims 1 to 9 by detecting the presence of mRNA encoding said 
receptor, which comprises obtaining total mRNA from a cell and contacting the mRNA so obtained with the nucleic 
acid probe according to the claim 1 2 under hybridising conditions and detecting the presence of mRNA hybridised 
to the probe, thereby detecting the expression of the peptide by the cell. 

1 7. Method according to the claim 1 4 or 1 5, which comprises the step of detecting the presence of the peptide according 
to any of the preceding claims 1 to 9 on the surface of a cell which comprises contacting said cell with the antibody 
according to the claim 13 under conditions permitting binding of the antibody to the peptide and detecting the 
presence of the antibody bound to the cell, thereby detecting the presence of the peptide on the surface of the cell. 

1 8. Method for diagnosing a predisposition or a resistance to a disorder associated with the activity of a specific allele 
of the peptide according to any of the claims 1 to 9, and/or associated with infectious agents, preferably the HIV- 
1 and/or HIV-2 viruses, present in a subject, which comprises : 

a) obtaining nucleic acid molecules encoding said peptide from the cells of the subject, 

b) possibly performing a restriction digest of said nucleic acid molecules with a panel of restriction enzymes, 

c) possibly electrophoretically separating the resulting nucleic acid fragments on a sized gel, 

d) contacting the resulting gel or the obtained nucleic acid molecules labelled with a nucleic acid probe with 
a detectable marker and capable of specifically hybridising to said nucleic acid molecule, 

e) detecting labelled bands or in situ nucleic acid molecules which have hybridised to the said nucleic acid 
molecule labelled with a detectable marker to create a unique band pattern or in situ marking specific to the 
subject, 

f) preparing other nucleic acid molecules encoding said peptide obtained from the cells of other subjects for 
diagnosis by step a-e, and 

g) comparing the unique band pattern specific to the nucleic acid molecule of subjects suffering from the 
disorder from step e and the nucleic acid molecule obtained for diagnosis from step f to determine whether 
the patterns are the same or different and to diagnose thereby predisposition or resistance to the disorder if 
the patterns are the same or different. 
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19. Method for diagnosing a predisposition or a resistance to a disorder associated with the activity of a specific allele 
of the peptide according to any of the claims 1 to 9 or the presence of said peptide at the surface of cells, and/or 
associated with infectious agents, preferably the HIV and/or HIV-2 viruses, present in a subject, which comprises : 

a) obtaining a sample of a body fluid, preferably a blood sample comprising antigen presenting cells, from a 
subject, 

b) adding to said sample the antibody according to claim 13, 

c) detecting the cross-reaction between said antibody and the specific peptide, and 

d) determining whether the peptide corresponds to a receptor or an inactive receptor and diagnosing thereby 
a predisposition or a resistance to the disorder according to the type of the peptide present in the body fluid 
of the subject. 

20. Diagnostic and/or dosage device comprising the peptide according to any of the claims 1 to 9, the nucleic acid 
molecule according to the claim 1 0 or 1 1 , the nucleic acid probe according to the claim 12, the antibody according 
to the claim 13 and/or a mixture thereof, being possibly labelled with a detectable marker. 

21. Diagnostic and/or dosage device according to the claim 20, characterised in that it comprises the reactants for 
the detection and/or dosage of antigens, antibodies or nucleic acid sequences through a method selected from 
the group consisting of in situ hybridisation, hybridisation or recognition by marked specific antibodies, specially 
ELISA ® (Enzyme Linked Immunosorbent Assay) or RIA® (Radio Immunoassay), methods on filter, on a solid 
support, in solution, in "sandwich", on gel, by Dot blot hybridisation, by Northern blot hybridisation, by Southern 
blot hybridisation, by isotopic or non-isotopic labelling (such as immunofluorescence or biotinylation), by a tech- 
nique of cold probes, by genetic amplification, particularly PCR, LCR, NASBA or CPR, by a double immunodiffu- 
sion, by a counter-immunoelectrophoresis, by haemagglutination and/or a mixture thereof. 
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5EQ ID NO. 1 

Fig. 1 a 



GAATTCCCCCAACAGAGCCAAGCTCTCCATCTACT 5 9 

19 

TTCCCTTCACTACAAAACrTCA. 1 UC' i I'GGCCAAAAAGAGAGTTAATTCAATTn^aGACAX 119 

39 

CTATGTAGGCAATTAAAAACCTATT^ 179 

* " 59 

TAAA7ACAV1 L 1AGGAC ITIATAAAAGATCAl ill 1"IATTTATGCACAGGGTGGAACAAG 239 

79 

ATGGATTATOUGTGTCAAGTCCAATC^ 2 99 

MDYQVSSPIYD INYYTSSPC 99 

CAAAAAATCAATGTGAAGCAAATCGCAGCCCGCCTCCTG^ 359 

Q ICINVKQIAARLLP PLYSLV 119 

rrcATCirrtxrnrrrGTGGGCAACATC 4 19 

FIFGFVG NMLVILILINCXR 139 

CTGAAGAGCATCACTC^ICATCTAC 479 

LK'SMTDZYLLN&AXSOLFP 159 

CITACTGTCCCCrirTSGGCTCACrATO 53 9 

L TV P FWAHYAAAQWD FGNTM 179 

iVIXJAACTCTTGAGlGGGCTCTATri"rATaGGU l'lXJTL'CTCTOGAA 1 C x 1 L I'l LATCATC 599 

CQLLTGLYFIGFFSGIFFII 139 

CTCCTGACAATCGATAGGTACCTGGCTGTOTTC €5 9 

LLTIDRYLAVVH AVFALKAR 219 

ACGGTCAC LU ^ ritj GGGTGGTt^CAAGTCTGArCACITGGGTG G 719 

TVTFGVVTSVITWVVAVFAS 239 

CTCCCAGGAATCATCTTTACCAGATCTCAAAAAGAAGOTCTTC^ 779 

LPGIIFTRSQKSGLHYTCSS 259 

CATTTTC CAXACA 
H F P Y 
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GAATTCCCCCAACAGAGCCAAOriCTCaTCTAG 59 

19 

TTCCCTI^CTAaUAACTra 119 

39 

CTATTFIAGGCAATTAAAAACCTATTGATCTAT^ 17 9 

59 

TAAATACATTCTAGGAGTTTATAAAAGATCACCT 23 9 

79 

AraSATIATCAA£rn?rCaACTCCAATCT^ 299 

MDYQVSSPIYDIKYYTSSPC 99 

CAAAAAATCAATOTX^UGCAAATCGCA£^ 3S9 

QKINVKQ IAARLLPSLYSLV 113 

TTCATLUTIt^nTlVll^SGC^ 419 

PIFGF VG3TMLVILILINCKR 139 

CTGAAGAGCATGACTSACATCTA£CTCC^ 4 79 

LKSMTD r YLLNLAISDL FPL 1S9 

CTTACItrrCCCCrrCTOG^TTCACTATG^ 539 

LTVPFWAHYAAAQWDFGHTM 179 

TGTCAACTCTTCACAGGG CTLTA ITITATAGG Cl^^ 599 

CQLLT GLYFIGFFSGIF FI I 199 

C l C L-l ^ CAATCgATAGGTftCCra3CTGTCGTCCATG Cl 559 

LLTIDaYLAVVHAVFALKAR 219 

AOGGTCAC l I ' l^Tj GGGTGGTGACAAgTGTGATCACrTGGGTCGTSG O 'lU'llj m UO JIVT 7X9 

TVTFGVVTSVITWVVAVFAS 239 

CTCCCAgGAATCATCTTTACCAGATCTCAjWUUSAA LjU 11 riXJA TTACACCTCjCAGCrCT 779 

LPGIIFTRSQKBGLHYTCSS 259 

CATTTTCCATACAGTCAGTATCAATTCTGGAAGAATTTC 83 9 

HFPYSQYQFWKNFQTLK IVI 273 



SSQ ID NO. 2 FIG. lb 
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TTGGGGCTGGTCCTGCCGCIUC 1UU I CATCGTC&TCTGCTACTCGGGAATCCTAAAAACT 399 

uGLVLPLLVMVICYSGILKT 299 

ClTieTT CGGTTGTCGAAATGAGAAGA^^ 1VACCATC 959 

L L R CRNBKKRHRAVRL IPTI 319 

ATGA lU^lTl ' AriTlVICn ' CXtS GGCTCCCTACAA Ql 'r Ol 1 ^ 1019 

MIVYFLPWAPYHIVLLLNTF 339 



CAGGAATTCTTTGGCCITJAATAATTGCAfflAGCTC^ 1079 

QSFFGLNtfCSSStfRLDQAMQ 359 

GTGACAGAGA CUVriXiG GAroACGCA ClTjClXj CATCA 113 9 

VTSTLGMTKCCINPIIYAPV 379 

GGGGAGAAtil'lXIAGAAACTACClVrrAbl L 1TL I'lXXAAAAGCACATTCCCAiUlCGCTTC 1199 

GSKPRNrLLVFFQKHIAKRP 399 

TrauArarrcTTa^Trra 1259 

CKCCSIFQQ B A P 3 R A S SVYT 419 

a»TCCACTGGGGAGCAGGAAATATCTGTGGGCri^ 1319 

KSTGHQSISVGI.* 439 

GACCCAGTCAGAGTTGTtKIACATGGCTXAGT^ 1379 

459 

479 

TATTTGGCATCIGTTIAAWTAGATtAGATCCGAATTC 



SHQ ID NO ,2 (SUITE) 



FIG.lc 



30 



EP1 199 360 A2 



GAATTCCCCCAACAGAGCOUGCTCTCCATCT^ 59 

19 

TTCCCrrCACI3lCAAAACTT»^ 119 

39 

CIRTtrrAGCyi&ATTAAAAACCTATTC 179 

59 

TAAATACATTCTAGGACTTTATAAAJ^^ 239 

79 

ATOGATTATCAAGTGTCAAGTCCAATCTATT^^^ 2 99 

MDYQVSSPIYDIHYYTS3PC 99 

CIAAAAATCAATGTTSUGCAAATCGC^ 3 59 

QKIirVJCQIAAHtiLPPLYSIiV 119 

TTCA TLTriWin^ ' lVlX^ GCAACATGCT G G'rCAXCCT^ 419 

PIFGF VGNMLVILILINCX* 139 

CTCAAGAGCATCACIXSACATCDICCTGCTCAACCTGGCCA TCIV 479 

LKSMTDIYI*LNLAISD L F F L 159 

CTTk C 1V> IC CC LTrL M IU GGCTCACI*TCCT 539 

LTVPFWAHYAAAQWDFGNTM 179 

TGTCUtCTCTrGACaGGGCTCI ^ VLX TCXTQ 599 

CQLLTGLYF IGFFSGIFFI I 199 

CTCCTCACAATCGATAGgTACCTGG C T G T CG T C CATG ClG 1 G l ' 1 ' lWl ' J. ' UA AAAGCCAGG 65 9 

LLTIDRYLAVVHAVPALXAR 219 

ACGGTCA CCUVlXiG G GlKGm AC^ 719 

TVTFGVVTSVITWVVAVFAS 239 

CTCCCAGGAATCATCn TACCAGATCTCAAAAAGAAGGTCTT^TTACACCTGCAGCTCT 779 

LPGIIFTRSQXSGLHYTCSS 259 

CATTTTCCATACATTAAAGATAGTCATCTTGGGGCTGG'r C Cr G CCG C PUCrUi ' lXlA TGGT 339 

HFPYIKDS HLGAGPAAACKG 279 



SZQ ID NO. 3 FIG-Id 
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CATC^C^ 899 
CAGGGCTCTGAGGCrTATCTTCACa 959 



_<^CATTCTCCTTCTCCIGAA^ 



319 

1019 
339 



359 

CATCAACCCaTCATCTATt^CTTIl^^ 



CTTCCAAAAGQlCATTCCCAAACGCTTCTGCAAATG C T Li 1 1LUII I ' lTC CAGCAAGAGGC 
TCCCGAGCGAGCaAGCTCAGrnACACCCGATCCACItX^ 
^TTnTX3ACACGGACTCAAGTGGGCTC 

GTTATAGAGGGTCTAAGATTCATCCATTTATrTCGCATC^ 
GAAtTC 



1139 
379 

1199 
399 



CTGTCGG 1259 
419 



1319 
439 

1379 
459 

1439 
479 



SEQ ID NO. 3 (SUITE) 
riG.le 
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